CDS

Accession Number TCMCG021C38844
gbkey CDS
Protein Id XP_029119884.1
Location join(36972822..36972850,36972943..36973853,36974597..36974838,36974916..36975119,36976897..36976989,36977094..36977201,36977457..36977840,36977923..36978009,36980673..36980723)
Gene LOC105043294
GeneID 105043294
Organism Elaeis guineensis

Protein

Length 702aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268357
db_source XM_029264051.1
Definition uncharacterized protein LOC105043294 isoform X1 [Elaeis guineensis]

EGGNOG-MAPPER Annotation

COG_category S
Description KAT8 regulatory NSL complex subunit
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
KEGG_ko ko:K07020        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGATAAGAGTTGGATTCATCAATCAAGAGAGAGTGATGCATACTTGGAAGGAGTTGGTCGATTCCTCGATTTTGCATTTGACAAAGCTGCACAGAGGGGGCTTATCCTTTGCCCATGTAGAAAATGCAACAACTGTTATTGGAAGAATCGAGAAGATGTGTATGAGCATCTGACATGTGATGGTTTTATGAGGAATTATAGCCATTGGATTTTCCATGGGGAAACTTTCTTAACTTCTACATCTGCTAATGCCCCACATTCAGACTCACCACCTCTTACGTACTCTACCATGAGTAACAAAAAGTCACCTAGGTGCCTGGTGGGTCCTATGGCCAGGATTGTTTGTCTGCAGCAAAAAGATCAGTCAGGTCCTAACCAGCCACAAAGCCAGCAGATCCAGCAGTCCTCACCAGCCTACAGCCAGCCACTTTACCCACACGATGAACCTAGCTTGGCCCAGTTGAGTTGTATCCAGCCATATGAAACTCAATTGCCTCAAGGGAGCTTCAGTGAACCACATCAGACACAATCACAACAGTCGAGCTCCAGCCAGCCACCTCAGACCCACTCACCTCAGCAGAGCTCTAGCCAGCTACATCAGACCCAGGTAAGGTCTAGCTATCTGGATCAGTCACTGCCAGCTCCACCACCACCAAGCTCTGGCCAGTCAGATCTGGCACAGCCAATGCAGTTAAGCTTCAACAAGCTAGAACAGATACAGTCAGACCAGCCAAGCTTCAGCCAGCTAAATCAGATGCAGCTGGCTCAACTGAAGTCCGGTCAGCCAGGGCAGATGCAACTGCTCCAGCCTAGCTCCAGTCAGCCAGATCAGATGCAGCTGACCCAGCTGAGTTCCAGCCTACTAGATCAGAAGCATTTGGCTGAGCCAAGCTCTAAAGAACCACAAGCCTCCCGGCCATGGATTGGTGATGATACTGGTTCTGCCAGAAAACGAAGGGGTCGAGGTCCTACACGGTGCCTTGATGTATGGAACAGTCCTGAGGGTCAGCACATACGTGTTGCTTTCAACAACCTTGGCCAGCCTATCGGCCGGAAGGCAGCAAAGCTAAGCAACTTCTTGGGTACTATAGCACGAGATGGGCACTTGGCACCCCTTAATTTTATTGATTGGAGGGCTGTGCCAGATGGATCCAAGGAGAAGATGTGGCAGCTTGTTGAGTCAAAGTTTGACATTGACCCTATTGGTAAAGATTGGGTCTTGAAGTCCTTGGGTACGAAATGGAGGAACTGGAAAGCTGAGTTAAAGGCTGCCCATTATGATACTCACAAGACTGATGAGGAGCGGCTTGCTGATTGTGATAAGAGGGTTGTACCAGATCAGTGGCCATTTCTTGTGGCGTATTGGAGCTCTAAAAAGGGGAAGGCACGTAGTAATACCAACAGGGCTAATCGTGTGCATCTGAGGTTTGGCCATACTTCTGGCACAAAGAGCTTTGCACGTATACGTGAAGAGGAGCGGGTGAAGAGGCCTGATGGGCAGGAGCTATCTCGGGCAGAGCTTTTTGTATTGACTCATACACACAAAGATGGGCAGCCCATGGATGAGGCTTCACTGGAGGCAATTTCACAACTTCATGAACAAGCAAGACAGCAAGCTGAGAGCTCACATGGTAGCATTGATTGGGATTATGTATTCTTTCAGGTCATGGGAGAAGAAAAGCCTGGCCGTCTGCGTACATATGGGTTGGGTCCTTCTCCCTCTGATATCTATGGCCCACGACCAACCCGCAGTGAGGCCATGAAAATGGTTTCAGAGGCTAAGAAGGCTGCTGATGAGGAGGTTCGCATGATGAAGGAGAAAATGAATGCTATGGAGCAAAAATATACTGAGATGCAAACTCAGATGACTATGATGATAACGAGAATGGAGGCTATGCATAAGAGGTTCCTTGATGAGCAGTTGTCTGATAATACAGGTGCACCATCAGAGCCTTTGGGTTCCAGACAGGCTCCTGACACCTCCAGTGCTCAGGAAGCTTTGCAGCAGTCGCAGGCACATTCCTTATTAGCAGGTCATGCAGATCCATCTGATGAGGGACGAGCAGCCAAGAGGGGAAAGGCGCACAGGGTCCTTAGGACAAGGTAA
Protein:  
MDKSWIHQSRESDAYLEGVGRFLDFAFDKAAQRGLILCPCRKCNNCYWKNREDVYEHLTCDGFMRNYSHWIFHGETFLTSTSANAPHSDSPPLTYSTMSNKKSPRCLVGPMARIVCLQQKDQSGPNQPQSQQIQQSSPAYSQPLYPHDEPSLAQLSCIQPYETQLPQGSFSEPHQTQSQQSSSSQPPQTHSPQQSSSQLHQTQVRSSYLDQSLPAPPPPSSGQSDLAQPMQLSFNKLEQIQSDQPSFSQLNQMQLAQLKSGQPGQMQLLQPSSSQPDQMQLTQLSSSLLDQKHLAEPSSKEPQASRPWIGDDTGSARKRRGRGPTRCLDVWNSPEGQHIRVAFNNLGQPIGRKAAKLSNFLGTIARDGHLAPLNFIDWRAVPDGSKEKMWQLVESKFDIDPIGKDWVLKSLGTKWRNWKAELKAAHYDTHKTDEERLADCDKRVVPDQWPFLVAYWSSKKGKARSNTNRANRVHLRFGHTSGTKSFARIREEERVKRPDGQELSRAELFVLTHTHKDGQPMDEASLEAISQLHEQARQQAESSHGSIDWDYVFFQVMGEEKPGRLRTYGLGPSPSDIYGPRPTRSEAMKMVSEAKKAADEEVRMMKEKMNAMEQKYTEMQTQMTMMITRMEAMHKRFLDEQLSDNTGAPSEPLGSRQAPDTSSAQEALQQSQAHSLLAGHADPSDEGRAAKRGKAHRVLRTR